Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages
نویسندگان
چکیده
This paper presents an approach to the translation of compound words without the need for bilingual training text, by modeling the mapping of literal component word glosses (e.g. “iron-path”) into fluent English (e.g. “railway”) across multiple languages. Performance is improved by adding component-sequence and learnedmorphology models along with context similarity from monolingual text and optional combination with traditional bilingual-textbased translation discovery.
منابع مشابه
Machine Translation for Languages Lacking Bitext via Multilingual Gloss Transduction
We propose and evaluate a new paradigm for machine translation of low resource languages via the learned surface transduction and paraphrase of multilingual glosses.
متن کاملPolyglot Semantic Parsing in APIs
Traditional approaches to semantic parsing (SP) work by training individual models for each available parallel dataset of text-meaning pairs. In this paper, we explore the idea of polyglot semantic translation, or learning semantic parsing models that are trained on multiple datasets and natural languages. In particular, we focus on translating text to code signature representations using the s...
متن کاملMultilingual eXtended WordNet Knowledge Base: Semantic Parsing and Translation of Glosses
This paper presents a method to create WordNet-like lexical resources for different languages. Instead of directly translating glosses from one language to another, we perform first semantic parsing of WordNet glosses and then translate the resulting semantic representation. The proposed approach simplifies the machine translation of the glosses. The approach provides ready to use semantic repr...
متن کاملOn The Effects of Literal Translation, L1 Glosses and Context, Applied in Reading Comprehension, on Iranian EFL Learners Vocabulary Learning: The Case of Different Proficiencies
This paper aims at discovering and investigating the effect of l1 translation, l1 gloss andcontext on vocabulary learning on EFL Iranian learners. A total number of 120 EFL students inprivate English institutes in Sari participated in the present study. They were divided into twoproficiency groups and three learning conditions. In order to make a list of words unknown to thelearners the partici...
متن کاملWord Representation Models for Morphologically Rich Languages in Neural Machine Translation
Dealing with the co mplex word forms in morphologically rich languages is an open problem in language processing, and is particularly important in translation. In contrast to most modern neural systems of translation, which discard the identity for rare words, in this paper we propose several architectures for learning word representations from character and morpheme level word decompositions. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008